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STEREO CODING AND DECODING METHODS AND APPARATUSES THEREOF 



The present invention relates to methods of coding data, for example to a 
method of coding audio and/or image data utilizing variable angle rotation of data 
components. Moreover, the invention also relates to encoders employing such methods, and 
to decoders operable to decode data generated by these encoders. Furthermore, the invention 
5 is concerned with encoded data communicated via data carriers and/or communication 
networks, the encoded data being generated according to the methods. 



Numerous contemporary methods are known for encoding audio and/or image 
10 data to generate corresponding encoded output data. An example of a contemporary method 
of encoding audio is MPEG-1 Layer III known as MP3 and described in ISO/IEC 
JTC1/SC29/WG1 1 MPEG, IS 1 1 172-3, Information Technology - Coding of Moving 
Pictures and Associated Audio for Digital Storage Media at up to about 1.5 Mbit/s, Part 3: 
Audio, MPEG-1, 1992. Some of these contemporary methods are arranged to improve coding 
15 efficiency, namely provide enhanced data compression, by employing mid/side (M/S) stereo 
coding or sum/difference stereo coding as described by J.D. Johnston and A.J. Ferreira, 
"Sum-difference stereo transform coding", in Proc. IEEE, Int. Conf. Acoust., Speech and 
Signal Proc, San Francisco, CA, March 1992, pp. II: pp. 569-572. 

In M/S coding, a stereo signal comprises left and right signals l[n], r[n] 
20 respectively which are coded as a sum signal m[n] and a difference signal s[n], for example 
by applying processing as described by Equations 1 and 2 (Eq. 1 and 2): 

m[n]= r[n] + l[n] Eq. 1 

s[n] = r[n]-l[n] Eq. 2 

25 When the signals l[n] and r[n] are almost identical, the M/S coding is capable 

of providing significant data compression on account of the difference signal s[n] 
approaching zero and thereby conveying relatively little information whereas the sum signal 
effectively includes most of the signal information content. In such a situation, a bit rate 
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required to represent the sum and difference signals is close to half that required for 
independently coding the signals l[n] and r[n]. 

Equations 1 and 2 are susceptible to being represented by way of a rotation 
matrix as in Equation 3 (Eq. 3): 



1>]J 



= c 




l[n] 

— 1 H"] ) 



Eq. 3 



wherein c is a constant scaling coefficient often used to prevent clipping. 

Whereas Equation 3 effectively corresponds to a rotation of the signals l[n], 
r[n] by an angle of 45°, other rotation angles are possible as provided in Equation 4 (Eq. 4) 
wherein a is a rotation angle applied to the signals l[n], r[n] to generate corresponding coded 
signals m f [n], s'[n] hereinafter described as relating to dominant and residual signals 
respectively: 
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Eq. 4 



The angle a is beneficially made variable to provide enhanced compression 
for a wide class of signals l[n], r[n] by reducing information content present in the residual 
signal s'[n] and concentrating information content in the dominant signal m f [n], namely 
minimize power in the residual signal s'[n] and consequently maximize power in the 
dominant signal m'[n]. 

Coding techniques represented by Equations 1 to 4 are conventionally not 
applied to broadband signals but to sub-signals each representing only a smaller part of a full 
bandwidth used to convey audio signals. Moreover, the techniques of Equations 1 to 4 are 
also conventionally applied to frequency domain representations of the signals l[n], r[n]. 

In a published US patent no. US 5, 621, 855, there is described a method of 
sub-band coding a digital signal having first and second signal components, the digital signal 
being sub-band coded to produce a first sub-band signal having a first q-sample signal block 
in response to the first signal component, and a second sub-band signal having a second q- 
sample signal block in response to the second signal component, the first and second sub- 
band signals being in the same sub-band and the first and second signal blocks being time 
equivalent. 

The first and second signal blocks are processed to obtain a minimum distance 
value between point representations of time-equivalent samples. When the minimum distance 
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value is less than or equal to a threshold distance value, a composite block composed of q 
samples is obtained by adding the respective pairs of time-equivalent samples in the first and 
second signal blocks together after multiplying each of the samples of the first block by 
cos(oc) and each of the samples of the second signal block by -sin(oc). 
5 Although application of the aforementioned rotation angle a is susceptible to 

eliminating many disadvantages of M/S coding where only a 45° rotation is employed, such 
approaches are found to be problematic when applied to groups of signals, for example stereo 
signal pairs, when considerable relative mutual phase or time offsets in these signals occur. 
The present invention is directed at addressing this problem. 

10 

An object of the present invention is to provide a method of encoding data. 

According to a first aspect of the present invention, there is provided a method 
of encoding a plurality of input signals (1, r) to generate corresponding encoded data, the 
15 method comprising steps of: 

(a) processing the input signals (1, r) to determine first parameters (92) describing 

at least one of relative phase difference and temporal difference between the signals (1, r), and 
applying these first parameters (q> 2 ) to process the input signals to generate corresponding 
intermediate signals; 

20 (b) processing the intermediate signals and/or the input signals (1, r) to determine 

second parameters describing rotation of the intermediate signals required to generate a 
dominant signal (m) and a residual signal (s), said dominant signal (m) having a magnitude or 
energy greater than that of the residual signal (s), and applying these second parameters to 
process the intermediate signals to generate the dominant (m) and residual (s) signals; 

25 (c) quantizing the first parameters, the second parameters, and encoding at least a 

part of the dominant signal (m) and the residual signal (s) to generate corresponding 
quantized data; and 

(d) multiplexing the quantized data to generate the encoded data. 

The invention is of advantage in that it is capable of providing for more 
30 efficient encoding of data. 

Preferably, in the method, only a part of the residual signal (s) is included in 
the encoded data. Such partial inclusion of the residual signal (s) is capable of enhancing data 
compression achievable in the encoded data. 
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More preferably, in the method, the encoded data also includes one or more 
parameters indicative of parts of the residual signal included in the encoded data. Such 
indicative parameters are susceptible to rendering subsequent decoding of the encoded data 
less complex. 

5 Preferably, steps (a) and (b) of the method are implemented by complex 

rotation with the input signals (l[n], r[n]) represented in the frequency domain (l[k], r[k]). 
Implementation of complex rotation is capable of more efficiently coping with relative 
temporal and/or phase differences arising between the plurality of input signals. More 
preferably, steps (a) and (b) are performed in the frequency domain or a sub-band domain. 

10 "Sub-band" is to be construed to be a frequency region smaller than a full frequency 
bandwidth required for a signal. 

Preferably, the method is applied in a sub-part of a full frequency range 
encompassing the input signals (1, r). More preferably, other sub-parts of the full frequency 
range are encoded using alternative encoding techniques, for example conventional M/S 

15 encoding as described in the foregoing. 

Preferably, the method includes an additional step after step (c) of losslessly 
coding the quantized data to provide the data for multiplexing in step (d) to generate the 
encoded data. More preferably, the lossless coding is implemented using Huffman coding. 
Utilizing lossless coding enables potentially higher audio quality to be achieved. 

20 Preferably, the method includes a step of manipulating the residual signal (s) 

by discarding perceptually non-relevant time- frequency information present in the residual 
signal (s), said manipulated residual signal (s) contributing to the encoded data (100), and 
said perceptually non-relevant information corresponding to selected portions of a spectro- 
temporal representation of the input signals. Discarding perceptually non-relevant 

25 information enables the method to provide a greater degree of data compression in the 
encoded data. 

Preferably, in step (b) of the method, the second parameters (a; IID, p) are 
derived by minimizing the magnitude or energy of the residual signal (s). Such an approach is 
computationally efficient for generating the second parameters in comparison to alternative 
30 approaches to deriving the parameters. 

Preferably, in the method, the second parameters (a; IID, p) are represented by 
way of inter-channel intensity difference parameters and coherence parameters (IID, p). Such 
implementation of the method is capable of providing backward compatibility with existing 
parametric stereo encoding and associated decoding hardware or software. 
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Preferably, in steps (c) and (d) of the method, the encoded data is arranged in 
layers of significance, said layers including a base layer conveying the dominant signal (m), a 
first enhancement layer including first and/or second parameters corresponding to stereo 
imparting parameters, a second enhancement layer conveying a representation of the residual 
5 signal (s). More preferably, the second enhancement layer is further subdivided into a first 
sub-layer for conveying most relevant time-frequency information of the residual signal (s) 
and a second sub-layer for conveying less relevant time-frequency information of the residual 
signal (s). Representation of the input signals by these layers, and sub-layers as required is 
capable of enhancing robustness to transmission errors of the encoded data and rendering it 
10 backward compatible with simpler decoding hardware. 

According to a second aspect of the invention, there is provided an encoder for 
encoding a plurality of input signals (1, r) to generate corresponding encoded data, the 
encoder comprising: 

(a) first processing means for processing the input signals (1, r) to determine first 
1 5 parameters (92) describing at least one of relative phase difference and temporal difference 

between the signals (1, r), the first processing means being operable to apply these first 
parameters (92) to process the input signals to generate corresponding intermediate signals; 

(b) second processing means for processing the intermediate signals to determine 
second parameters describing rotation of the intermediate signals required to generate a 

20 dominant signal (m) and a residual signal (s), said dominant signal (m) having a magnitude or 
energy greater than that of the residual signal (s), the second processing means being 
operable to apply these second parameters to process the intermediate signals to generate at 
least the dominant (m) and residual (s) signals; 

(c) quantizing means for quantizing the first parameters ((p2), the second 

25 parameters (a; IID, p), and at least a part of the dominant signal (m) and the residual signal 
(s) to generate corresponding quantized data; and 

(d) multiplexing means for multiplexing the quantized data to generate the 
encoded data. 

The encoder is of advantage in that it is capable of providing for more efficient 
30 encoding of data. 

Preferably, the encoder comprises processing means for manipulating the 
residual signal (s) by discarding perceptually non-relevant time- frequency information 
present in the residual signal (s), said transformed residual signal (s) contributing to the 
encoded data (100) and said perceptually non-relevant information corresponding to selected 
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portions of a spectro-temporal representation of the input signals. Discarding perceptually 
non-relevant information enables the encoder to provide a greater degree of data compression 
in the encoded data. 

According to a third aspect of the present invention, there is provided a 
5 method of decoding encoded data to regenerate corresponding representations of a plurality 
of input signals (F, r*), said input signals (1, r) being previously encoded to generate said 
encoded data, the method comprising steps of: 

(a) de-multiplexing the encoded data to generate corresponding quantized data; 

(b) processing the quantized data to generate corresponding first parameters ((p 2 ), 
10 second parameters, and at least a dominant signal (m) and a residual signal (s), said dominant 

signal (m) having a magnitude or energy greater than that of the residual signal (s); 

(c) rotating the dominant (m) and residual (s) signals by applying the second 
parameters to generate corresponding intermediate signals; and 

(d) processing the intermediate signals by applying the first parameters (92) to 

15 regenerate said representations of said input signals (1', r 1 ), the first parameters (92) describing 
at least one of relative phase difference and temporal difference between the signals (1, r). 

The method provides an advantage of being capable of efficiently decoding 
data which has been efficiently coding using a method according to the first aspect of the 
invention. 

20 Preferably, step (b) of the method includes a further step of appropriately 

supplementing missing time-frequency information of the residual signal (s) with a synthetic 
residual signal derived from the dominant signal (m). Generation of the synthetic signal is 
capable of resulting in efficient decoding of encoded data. 

Preferably, in the method, the encoded data includes parameters indicative of 

25 which parts of the residual signal (s) are encoded into the encoded data. Inclusion of such 

indicative parameters is capable of rendering decoding for efficient and less computationally 
demanding. 

According to a fourth aspect of the present invention, there is provided a 
decoder for decoding encoded data to regenerate corresponding representations of a plurality 
30 of input signals (F, r'), said input signals (1, r) being previously encoded to generate the 
encoded data, the decoder comprising: 

(a) de-multiplexing means for de-multiplexing the encoded data to generate 

corresponding quantized data; 
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(b) first processing means for processing the quantized data to generate 

corresponding first parameters (92), second parameters, and at least a dominant signal (m) 
and a residual signal (s), said dominant signal (m) having a magnitude or energy greater than 
that of the residual signal (s); 
5 (c) second processing means for rotating the dominant (m) and residual (s) signals 

by applying the second parameters to generate corresponding intermediate signals; and 
(d) third processing means for processing the intermediate signals by applying the 

first parameters (92) to regenerate said representations of the input signals (1, r), the first 
parameters (92) describing at least one of relative phase difference and temporal difference 

10 between the signals (1, r). 

Preferably, the second processing means is operable to generate a 
supplementary synthetic signal derived from the decoded dominant signal (m) for providing 
information missing from the decoded residual signal. 

According to a fifth aspect of the invention, there is provided encoded data 

1 5 generated according to the method of the first aspect of the invention, the data being at least 
one of recorded on a data carrier and communicable via a communication network. 

According to a sixth aspect of the invention, there is provided software for 
executing the method of the first aspect of the invention on computing hardware. 

According to a seventh aspect of the invention, there is provided software for 

20 executing the method of the third aspect of the invention on computing hardware. 

According to an eighth aspect of the invention, there is provided encoded data 
at least one of recorded on a data carrier and communicable via a communication network, 
said data comprising a multiplex of quantizing first parameters, quantized second parameters, 
and quantized data corresponding to at least a part of a dominant signal (m) and a residual 

25 signal (s), wherein the dominant signal (m) has a magnitude or energy greater than the 

residual signal (s), said dominant signal (m) and said residual signal (s) being derivable by 
rotating intermediate signals according to the second parameters, said intermediate signals 
being generated by processing a plurality of input signals to compensate for relative phase 
and/or temporal delays therebetween as described by the first parameters. 

30 It will be appreciated that features of the invention are susceptible to being 

combined in any combination without departing from the scope of the invention as defined in 
the accompanying claims. 
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Embodiments of the invention will now be described, by way of example only, with 
reference to the following diagrams wherein: 

Fig. 1 is an illustration of sample sequences for signals l[n], r[n] subject to 
relative mutual time and phase delays; 
5 Fig. 2 is an illustration of application of a conventional M/S transform 

pursuant to Equations 1 and 2 applied to the signals of Fig. 1 to generate corresponding sum 
and difference signals m[n], s[n]; 

Fig. 3 is an illustration of application of a rotation transform pursuant to 
Equation 4 applied to the signals of Fig. 1 to generate corresponding dominant m[n] and 
10 residual s[n] signals; 

Fig. 4 is an illustration of application of a complex rotation transform 
according to the invention pursuant to Equations 5 to 15 to generate corresponding dominant 
m[n] and residual s[n] signals wherein the residual signal is of relatively small amplitude 
despite the signals of Fig. 1 having relative mutual phase and time delay; 
15 Fig. 5 is a schematic diagram of an encoder according to the invention; 

Fig. 6 is a schematic diagram of a decoder according to the invention, the 
encoder being compatible with the encoder of Fig. 5; 

Fig. 7 is a schematic diagram of a parametric stereo decoder; 

Fig. 8 is a schematic diagram of an enhanced parametric stereo encoder 
20 according to the invention; and 

Fig. 9 is a schematic diagram of an enhanced parametric stereo decoder 
according to the invention, the decoder being compatible with the encoder of Fig. 9. 



25 In overview, the present invention is concerned with a method of coding data 

which represents an advance to M/S coding methods described in the foregoing employing a 
variable rotation angle. The method is devised by the inventors to be better capable of coding 
data corresponding to groups of signals subject to considerable phase and/or time offset. 
Moreover, the method provides advantages in comparison to conventional coding techniques 

30 by employing values for the rotation angle a which can be used when the signals l[n], r[n] 
are represented by their equivalent complex-valued frequency domain representations l[k], 
r[k] respectively. 

The angle ex can be arranged to be real- valued and a real-valued phase rotation 
applied to mutually "cohere" the l[n], r[n] signals to accommodate mutual temporal and/or 



WO 2005/098825 



PCT/IB2005/051058 



9 

phase delays between these signals. However, use of complex values for the rotation angle a 
renders the present invention easier to implement. Such an alternative approach to 
implementing rotation by angle a is to be construed to be within the scope of the present 
invention. 

5 Frequency-domain representations of the aforesaid time-domain signals l[n], 

r[n] are preferably derived by applying a temporal windowing procedure as described by 
Equations 5 and 6 (Eq. 5 and 6) to provide windowed signals lq[n], r q [n]: 

l,[n] = l[n + qH]h[n] Eq. 5 



10 r q [n]=r[n + qH\h[n\ Eq. 6 

wherein 

q = a frame index such that q = 0, 1,2, ... to indicate consecutive signal frames; 

H = a hop-size or update-size; and 

n = a time index having a value in a range of 0 to L-l wherein a parameter L is 

15 equivalent to the length of a window h[n]. 

The windowed signals lq[n], r q [n] are transformable to the frequency domain 
by using a Discrete Fourier Transform (DFT), or functionally equivalent transform, as 
described in Equations 7 and 8 (Eq. 7 and 8): 



20 



*]-g/>W-y2fO 



Eq. 7 



r[k] = ^r [n\exp( - Eq. 8 

wherein a parameter N represents a DFT length such that N > L . On account of the DFT of a 
real- valued sequence being symmetrical, only the first N / 2 +l points are preserved after the 
transform. In order to preserve signal energy when implementing the DFT, the following 
25 scaling as described in Equations 9 and 10 (Eq. 9 and 10) is preferably employed: 

/[0]=^1 Eq.9 

r[0] = ^|! Eq.10 

The method of the invention performs signal processing operations as depicted 
30 by Equation 1 1 (Eq. 1 1) to convert the frequency domain signal representations l[k], r[k] in 
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Equations 7 and 8 to corresponding rotated sum and difference signals m"[k], s M [k] in the 
frequency domain: 

r m"[k]\_( cos(a) sin (ex ^ 
^"[k] J ^-sin(a) cos(c 



(a)Ye'» 
(a)[ 0 



0 



Eq. 11 



wherein 
a = 

(pi = 



real- valued variable rotation angle; 

a common angle used to maximise the continuation of signals over associated 
boundaries; and 

cp 2 = an angle used to minimize the energy of the residual signal s"[k] by phase- 

rotating the right signal r[k]. 

Use of the angle <pi is optional. Moreover, rotations pursuant to Equation 1 1 
are preferably executed on a frame-by-frame basis, namely dynamically in frame steps. 
However, such dynamic changes in rotation from frame-to- frame can potentially cause signal 
discontinuities in the sum signal m"[k] which can be at least partially removed by suitable 
selection of the angle <pi. 

Furthermore, the frequency range k = 0 ... N /2+l of Equation 1 1 is preferably 
divided into sub-ranges, namely regions. For each region during encoding, its corresponding 
angle parameters a, (pi and cp 2 are then independently determined, coded and then transmitted 
or otherwise conveyed to a decoder for subsequent decoding. By arranging for the frequency 
range to be sub-divided, signal properties can be better captured during encoding resulting 
potentially in higher compression ratios. 

After implementing mappings pursuant to Equations 7 to 1 1, the signals m"[k], 
s"[k] are subjected to an inverse Discrete Fourier Transform as described in Equations 12 and 
13 (Eq. 12 & 13): 



Eq. 12 



wherein 

mq[n] = dominant time-domain representation; and 

s q [n]= residual (difference) time-domain representation. 
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The dominant and residual representations are then converted in the method to 
representations on a windowed basis to which overlap is applied as provided by processing 
operations as described by Equations 14 and 15 (Eq. 14 and 15): 

m[n + qH] = m[n + qH]+2Re{m q [n\h[n]\ Eq. 14 

5 

s[n + qH] = s[n + qH] + 2Re{s q [n\h[n]\ Eq. 15 

Alternatively, processing operations of the method of the invention as 
described by Equations 5 to 15 are susceptible, at least in part, to being implemented in 
practice by employing complex-modulated filter banks. Digital processing applied in 
10 computer processing hardware can be employed to implement the invention. 

In order to illustrate the method of the invention, a signal processing example 
of the invention will now be described. For the example, two temporal signals are used as 
initial signals to be processed using the method, the two signals being defined by Equations 
16 and 17 (Eq. 16 and 17): 
15 /[/7]=0.5cos(0.32« + 0.4)+0.05.z 1 [«] + 0.06.z 2 H Eq. 16 



r[n] = 0.25cos(0.32« + 1.8)+ 0.03.Z, [w] + 0.05.z 3 [n] Eq. 17 

wherein Zi[n], z 2 [n] and Z3[n] are mutually independent white noise sequences of unity 
variance. In order to better appreciate operation of the method of the invention, portions of 

20 the signals l[n], r[n] described by Equations 16 and 17 are shown in Fig. 1. 

In Fig. 2, M/S transform signals m[n] and s[n] are illustrated, these transform 
signals being derived from the signals l[n],r[n] of Equations 16 and 17 by conventional 
processing pursuant to Equations 1 and 2. It will be seen from Fig. 2 that such a conventional 
approach to generating the signals m[n] and s[n] from the signals of Equations 16 and 17 

25 results in the energy of the residual signal s[n] being higher than the energy of the input 

signal r[n] in Equation 17. Clearly, conventional M/S transform signal processing applied to 
the signals of Equations 16 and 17 is ineffective at resulting in signal compression because 
the signal s[n] is not of negligible magnitude. 

By employing a rotation transform as described by Equation 4, it is possible 

30 for the example signals l[n], r[n] to reduce the residual energy in their corresponding residual 
signal s[n] and correspondingly enhance their dominant signal m[n] as illustrated in Fig. 3. 
Although the rotation approach of Equation 4 is capable of performing better than 
conventional M/S processing as presented in Fig. 2, it is found by the inventors to be 
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unsatisfactory when the signals l[n], r[n] are subject to relative mutual phase and/or time 
shifts. 

When the sample signals l[n], r[n] of Equations 16 and 17 are subjected to 
transformation to the frequency domain, then subjected to a complex optimizing rotation 
5 pursuant to the Equations 5 to 15, it is feasible to reduce the energy of the residual signal s[n] 
to a comparatively small magnitude as illustrated in Fig. 4. 

Embodiments of encoder hardware operable to implement signals processing 
as described by Equations 5 to 1 5 will next be described. 

In Fig. 5, there is shown an encoder according to the invention indicated 

10 generally by 10. The encoder 10 is operable to receive left (1) and right (r) complementary 

input signals and encode these signals to generate an encoded bit-stream (bs) 100. Moreover, 
the encoder 10 includes a phase rotation unit 20, a signal rotation unit 30, a time/frequency 
selector 40, a first coder 50, a second coder 60, a parameter quantizing processing unit (Q) 70 
and a bit-stream multiplexer unit 80. 

1 5 The input signals 1, r are coupled to inputs of the phase rotation unit 20 whose 

corresponding outputs are connected to the signal rotation unit 30. Dominant and residual 
signals of the signal rotation unit 30 are denoted by m, s respectively. The dominant signal m 
is conveyed via the first coder 50 to the multiplexer unit 80. Moreover, the residual signal s is 
coupled via the time/frequency selector 40 to the second coder 60 and thereafter to the 

20 multiplexer unit 80. Angle parameter outputs (pi, q>2 from the phase rotation unit 20 are 
coupled via the processing unit 70 to the multiplexer unit 80. Additionally, an angle 
parameter output a is coupled from the signal rotation unit 30 via the processing unit 70 to 
the multiplexer unit 80. The multiplexer unit 80 comprises the aforementioned encoded bit 
stream output (bs) 100. 

25 In operation, the phase rotation unit 20 applies processing to the signals 1, r to 

compensate for relative phase differences therebetween and thereby generate the parameters 
<pi, cp 2 wherein the parameter (p 2 is representative of such relative phase difference, the 
parameters cp t , cp 2 being passed to the processing unit 70 for quantizing and thereby including 
as corresponding parameter data in the encoded bit stream 100. The signals 1, r compensated 

30 for relative phase difference pass to the signal rotation unit 30 which determines an optimized 
value for the angle a to concentrate a maximum amount of signal energy in the dominant 
signal m and a minimum amount of signal energy in the residual signal s. The dominant and 
residual signals m, s then pass via the coders 50, 60 to be converted to a suitable format for 
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inclusion in the bit stream 100. The processing unit 70 receives the angle signals a, (pi, (p2 
and multiplexes them together with the output from the coders 50, 60 to generate the bit- 
stream output (bs) 100. Thus, the bit-stream (bs) 100 thereby comprises a stream of data 
including representations of the dominant and residual signals m, s together with angle 
5 parameter data a, q>i, cp 2 wherein the parameter q> 2 is essential and the parameters (pi are 
optional but nevertheless beneficial to include. 

The coders 50, 60 are preferably implemented as two mono audio encoders, or 
alternatively as one dual mono encoder. Optionally, certain parts of the residual signal s, for 
example identified when represented in a time-frequency plane, not perceptibly contributing 

10 to the bit stream 100 can be discarded in the time/frequency selector 40, thereby providing 
scalable data compression as elucidated in more detail below. 

The encoder 10 is optionally capable of being used for processing the input 
signals (1, r) over a part of a full frequency range encompassing the input signals. Those parts 
of the input signals (1, r) not encoded by the encoder 10 are then in parallel encoded using 

15 other methods, for example using conventional M/S encoding as described in the foregoing. 
If required individual encoding of left (1) and right (r) input signals can be implemented if 
required. 

The encoder 10 is susceptible to being implemented in hardware, for example 
as an application specific integrated circuit or group of such circuits. Alternatively, the 

20 encoder 10 can be implemented in software executing on computing hardware, for example 
on a proprietary software-driven signal processing integrated circuit or group of such circuits. 

In Fig. 6, a decoder compatible with the encoder 10 is indicated generally by 
200. The decoder 200 comprises a bit-stream demultiplexer 210, first and second decoders 
220, 230, a processing unit 240 for de-quantizing parameters, a signal rotation decoder unit 

25 250 and a phase rotation decoding unit 260 providing decoded outputs T, r' corresponding to 
the input signals 1, r input to the encoder 10. The demultiplexer 210 is configured to receive 
the bit-steam (bs) 100 as generated by the encoder 10, for example conveyed from the 
encoder 10 to the decoder 200 by way of a data carrier, for example an optical disk data 
carrier such as a CD or DVD, and/or via a communication network, for example the Internet. 

30 Demultiplexed outputs of the demultiplexer 210 are coupled to inputs of the decoders 220, 
230 and to the processing unit 240. The first and second decoders 220, 230 comprise 
dominant and residual decoded outputs m', s* respectively which are coupled to the rotation 
decoder unit 250. Moreover, the processing unit 240 includes a rotation angle output oc f 
which is also coupled to the rotation decoder unit 250; the angle a' corresponds to a decoded 



WO 2005/098825 PCT/IB2005/051058 

14 

version of the aforementioned angle a with regard to the encoder 10. Angle outputs qV, cp 2 ' 
correspond to decoded versions of the aforementioned angles (pi, (p 2 with regard to the 
encoder 10; these angle outputs (pi \ q> 2 ' are conveyed, together with decoded dominant and 
residual signal outputs from the rotation decoder unit 250 to the phase rotation decoding unit 
260 which includes decoded outputs l f , r' as illustrated. 

In operation, the decoder 200 performs an inverse of encoding steps executed 
within the encoder 10. Thus, in the decoder 200, the bit-stream 100 is demultiplexed in the 
demultiplexer 210 to isolate data corresponding to the dominant and residual signals which 
are reconstituted by the decoders 220, 230 to generate the decoded dominant and residual 
signals m\ s f . These signals m', s* are then rotated according to the angle a' and then corrected 
for relative phase using the angles (pi', (p 2 ' to regenerate the left and right signals 1', r\ The 
angles <pi f , q> 2 ', a' are regenerated from parameters demultiplexed in the demultiplexer 210 
and isolated in the processing unit 240. 

In the encoder 10, and hence also in the decoder 200, it is preferable to 
transmit in the bit-stream 1 00 an IID value and a coherence value p rather than the 
aforementioned angle a. The IID value is arranged to represent an inter-channel difference, 
namely denoting frequency and time variant magnitude differences between the left and right 
signals 1, r. The coherence value p denotes frequency variant coherence, namely similarity, 
between the left and right signals 1, r after phase synchronization. However, for example in 
the decoder 200, the angle a is readily derivable from the IID and p values by applying 
Equation 18 (Eq. 18): 



a — — arctan 
2 



/ IJD \ 

210 20 p 



up 
10 10 -1 



Eq. 18 



A parametric decoder is indicated generally by 400 in Fig. 7, this decoder 400 
being complementary to the encoders according to the present invention. The decoder 400 
comprises a bit-stream demultiplexer 410, a decoder 420, a de-correlation unit 430, a scaling 
unit 440, a signal rotation unit 450, a phase rotation unit 460 and a de-quantizing unit 470. 
The demuliplexer 410 comprises an input for receiving the bit-stream signal (bs) 100 and 
four corresponding outputs for signal m, s data, angle parameter data, IID data and coherence 
data p, these outputs are connected to the decoder 420 and to the de-quantizer unit 470 as 
shown. An output from the decoder 420 is coupled via the de-correlation unit 430 for 
regenerating a representation of the residual signal s' for input to the scaling function 440. 
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Moreover, a regenerated representation of the dominant signal m' is conveyed from the 
decoder unit 420 to the scaling unit 440. The scaling unit 440 is also provided with IID' and 
coherence data p* from the de-quantizing unit 470. Outputs from the scaling unit 440 are 
coupled to the signal rotation unit 450 to generate intermediate output signals. These 
5 intermediate output signals are then corrected in the phase rotation unit 460 using the angles 
<pi', (p2 r decoded in the de-quantizing unit 470 to regenerate representations of the left and 
right signals 1', r f . 

The decoder 400 is distinguished from the decoder 200 of Fig. 6 in that the 
decoder 400 includes the decorrelation unit 430 for estimating the residual signal s' based on 
10 the dominant signal m' by way of decorrelation processes executed within the de-correlation 
unit 430. Moreover, the amount of coherence between the left and right output signals V, r ! is 
determined by way of a scaling operation. The scaling operation is executed within the 
scaling unit 440 and is concerned with a ratio between the dominant signal m' and the 
residual signal s'. 

15 Referring next to Fig. 8, there is illustrated an enhanced encoder indicated 

generally by 500. The encoder 500 comprises a phase rotation unit 510 for receiving left and 
right input signals 1, r respectively, a signal rotation unit 520, a time/frequency selector 530, 
first and second coders 540, 550 respectively, a quantizing unit 560 and a multiplexer 570 
including the bit-stream output (bs) 100. Angle outputs (pi, q>2 from the phase rotation unit 

20 510 are coupled from the phase rotation unit 510 to the quantizing unit 560. Moreover, 

phase-corrected outputs from the phase rotation unit 510 are connected via the signal rotation 
unit 520 and the time/frequency selector 530 to generate dominant and residual signals m, s 
respectively, as well as IID and coherence p data/parameters. The IID and coherence p 
data/parameters are coupled to the quantizer unit 560 whereas the dominant and residual 

25 signals m, s are passed via the first and second coders 540, 550 to generate corresponding 
data for the multiplexer 570. The multiplexer 570 is also arranged to receive parameter data 
describing the angles <pi, <p2, the coherence p and the IID. The multiplexer 570 is operable to 
multiplex data from the coders 540, 550 and the quantizing unit 560 to generate the bit- 
stream (bs) 100. 

30 In the encoder 500, the residual signal s is encoded directly into the bit-stream 

100. Optionally, the time/frequency selector unit 530 is operable to determine which parts of 
the time/frequency plane of the residual signal s are encoded into the bit-stream (bs) 100, the 
unit 530 thereby determining a degree to which residual information is included the bit- 
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stream 100 and hence affecting a compromise between compression attainable in the encoder 
500 and degree of information included within the bit-stream 100. 

In Fig. 9, an enhanced parametric decoder is indicated generally by 600, the 
decoder 600 being complementary to the encoder 500 illustrated in Fig. 8. The decoder 600 
5 comprises a demultiplexer unit 610, first and second decoders 620, 640 respectively, a de- 
correlation unit 630, a combiner unit 650, a scaling unit 660, a signal rotation unit 670, a 
phase rotation unit 680 and the de-quantizing unit 690. The demultiplexer unit 610 is coupled 
to receive the encoded bit-stream (bs) 100 and provide corresponding demultiplexed outputs 
to the first and second decoders 620, 640 and also to the de-multiplexer unit 690. The 

10 decoders 620, 640 in conjunction with the de-correlation unit 630 and the combiner unit 650 
are operable to regenerate representations of the dominant and residual signals m\ s f 
respectively. These representations are subjected to scaling processes in the scaling unit 660 
followed by rotations in the signal rotation unit 670 to generate intermediate signals which 
are then phase rotated in the rotation unit 680 in response to angle parameters generated by 

15 the de-quantizing unit 690 to regenerate representations of the left and right signals V, r\ 

In the decoder 600, the bit-stream 100 is de-multiplexed into separate streams 
for the dominant signal m', for the residual signal s' and for stereo parameters. The dominant 
and residual signals m', s' are then decoded by the decoders 620, 640 respectively. Those 
spectral/temporal parts of the residual signal s' which have been encoded into the bit-stream 

20 100 are communicated in the bit-stream 100 either implicitly, namely by detecting "empty" 
areas in the time- frequency plane, or explicitly, namely by means of representative signalling 
parameters decoded from the bit stream 100. The de-correlation unit 630 and the combiner 
unit 650 are operable to fill empty time-frequency areas in the decoded residual signal s' 
effectively with a synthetic residual signal. This synthetic signal is generated by using the 

25 decoded dominant signal m' and output from the de-correlation unit 650. For all other time- 
frequency areas, the residual signal s is applied to construct the decoded residual signal s'; for 
these areas, no scaling is applied in the scaling unit 660. Optionally, for these areas, it is 
beneficial to transmit the aforementioned angle a in the encoder 500 instead of IID and 
coherence p data as data rate required to convey the single angle parameter a is less than 

30 required to convey equivalent IID and coherence p parameter data. However, transmission of 
the angle a parameter in the bit stream 100 instead of the IID and p parameter data renders 
the encoder 500 and decoder 600 non-backwards compatible with regular conventional 
Parametric Stereo (PS) systems which utilize such IID and coherence p data. 
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The selector units 40, 530 of the encoders 10, 500 respectively are preferably 
arranged to employ a perceptual model when selecting which time- frequency areas of the 
residual signal s need to be encoded into the bit-stream 100. By coding various time- 
frequency aspects of the residual signal s in the encoders 10, 500, it is possible to thereby 
5 achieve bit-rate scalable encoders and decoders. When layers in the bit-stream 100 are 

mutually dependent, coded data corresponding to perceptually most relevant time- frequency 
aspects are included in a base layer included in the layers, with perceptually less important 
data moved to refinement or enhancement layers included in the layers; "enhancement layer" 
is also referred to as being "refinement layer". In such an arrangement, the base layer 

10 preferably comprises a bit stream corresponding to the dominant signal m, a first 

enhancement layer comprises a bit stream corresponding to stereo parameters such as 
aforementioned angles a, (pi, (p2, and a second enhancement layer comprises a bit stream 
corresponding to the residual signal s. 

Such an arrangement of layers in the bit-stream data 100 allows for the second 

1 5 enhancement layer conveying the residual signal s to be optionally lost or discarded; 

moreover, the decoder 600 illustrated in Fig. 10 is capable of combining decoded remaining 
layers with a synthetic residual signal as described in the foregoing to regenerate a 
perceptually meaningful residual signal for user appreciation. Furthermore, if the decoder 
600 is optionally not provided with the second decoder 640, for example due to cost and/or 

20 complexity restrictions, it is still possible to decode the residual signal s albeit at reduced 
quality. 

Further bit rate reductions in the bit stream (bs) 100 in the foregoing are 
possible by discarding encoded angle parameters <pi, (p2 therein. In such a situation, the phase 
rotation unit 680 in the decoder 600 reconstructs the regenerated output signals l 1 , r* using a 

25 default rotation angles of fixed value, for example zero value; such further bit rate reduction 
exploits a characteristic that the human auditory system is relative phase-insensitive at higher 
audio frequencies. As an example, the parameters q>2 are transmitted in the bit stream (bs) 100 
and the parameters <pi are discarded therefrom for achieving bit rate reduction. 

Encoders and complementary decoders according to the invention described in 

30 the foregoing are potentially useable in a broad range of electronic apparatus and systems, for 
example in at least one of: Internet radio, Internet streaming, Electronic Music Distribution 
(EMD), solid state audio players and recorders as well as television and audio products in 
general. 
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Although a method of encoding the input signals (1, r) to generate the bit- 
stream 100 is described in the foregoing, and complementary methods of decoding the bit- 
stream 100 elucidated, it will be appreciated that the invention is susceptible to being adapted 
to encode more than two input signals. For example, the invention is capable of being 
adapted for providing data encoding and corresponding decoding for multi-channel audio, for 
example 5-channel domestic cinema systems. 

In the accompanying claims, numerals and other symbols included within 
brackets are included to assist understanding of the claims and are not intended to limit the 
scope of the claims in any way. 

It will be appreciated that embodiments of the invention described in the 
foregoing are susceptible to being modified without departing from the scope of the invention 
as defined by the accompanying claims. 

Expressions such as "comprise", "include", "incorporate", "contain", "is" and 
"have" are to be construed in a non-exclusive manner when interpreting the description and 
its associated claims, namely construed to allow for other items or components which are not 
explicitly defined also to be present. Reference to the singular is also to be construed to be a 
reference to the plural and vice versa. 



